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An arborescence T is a tree whose edges are directed so that each is directed toward a different 
node. Exactly one node of T, called the root, has no edge of T directed toward it. Let G be any directed 
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is one) a spanning arborescence, with prescribed root, whose edges have maximum (or minimum) 
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Section 1 

A (directed) graph G, for purposes here, is a finite 
set of nodes and a finite set of edges, where each edge 
is said to be directed toward one of the nodes, called 
the front end of the edge, and said to be directed away 
from a different one of the nodes, called the rear end 
of the edge. An edge and each of its ends are said to 
meet. A subgraph of G is a subcollection of its members 
which, under the same incidence relations, is a graph. 
A graph is called connected if it is not empty and its 
members do not partition into two disjoint nonempty 
subgraphs. A polygon is a connected graph Q such 
that each node of Q meets exactly two edges of (J. An 
(elementary uniformly directed) circuit is a polygon 
which contains one edge directed toward, and one 
edge directed away from, each of its nodes. A forest 
is a graph which contains no polygon. A tree is a 
connected forest. A branching is a forest whose edges 
are directed so that each is directed toward a different 
node. An arborescence is a connected branching. An 
(elementary uniformly directed) path P is an arbor- 
escence such that each edge in P is directed away 
from a different node, and such that there is at least 
one edge in P. 

We shall occasionally use "obvious" facts about 
graphs without justifying them. 

Clearly, a branching (forest) is the union of a unique 
family of disjoint arborescences (trees). 

Exactly one node in an arborescence 7\ called the 
root of 7 1 , has no edge of T directed toward it. A 
branching (forest) is an arborescence (tree) if and 
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only if it has exactly one less edge than nodes. No 
branching (forest) has more edges than this. 

In a path P there are exactly two nodes, called the 
ends of P, which each meet only one edge in P. The 
rest of the nodes in P each meet exactly two edges in 
P. A path P is said to go from the node which is only 
a rear end in P (the root of P) to the node which is 
only a front end in P. For any arborescence 7\ and any 
node v in T except the root, there is a unique path in 
T going from the root to v. Any path in T going to v 
and any path in T going from v have only v in common, 
and their union is a path. And so on. 

Section 2 

Let G be any graph with a real numerical weight 
Cj corresponding to each edge ejeG. The problem 
treated here is to find in G a branching B which has 
maximum total weight, Xq, summed over ejeB. B is 
called an optimum branching in G. 

First we show that certain variations of the problem 
reduce immediately to it. 

A spanning subgraph of G is a subgraph which 
contains all the nodes of G. A branching in G is a 
spanning arborescence of G if and only if the number 
of its edges is one less than the number of nodes in G. 
No branching in G can have more edges than this. 

An optimum branching in G of course contains no 
edge with negative weight, and indeed may be empty 
if all Cj ^ 0. Even if all cj > and G contains a spanning 
arborescence, an optimum branching in G need not 
be an arborescence. 

If there is a spanning arborescence T in G, then an 
optimum one, i.e., one which has maximum total 
weight, Lcj, ejeT, can be found as an optimum branch- 
ing in G where the edges carry new weights 
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Cj = Cj + h, h > 2 | Cj | , ej-eG. 

A spanning arborescence in G which is optimum rela- 
tive to weights q, e/cG, is also optimum relative to 
weights Cj + k, e^G, for any constant k, since every 
spanning arborescence has the same number of edges. 
Constant h is larger than the difference in total 
weights (relative to weights q, e/eG) of any two branch- 
ings in G. It follows that an optimum branching in 
G, relative to weights cj = q + h, will be a branching 
with a maximum number of edges. In particular, it 
will be a spanning arborescence if and only if G 
contains a spanning arborescence. 

A spanning arborescence T in G which has minimum 
total weight, 2c ; , ejeT, is the same as one which has 
maximum total weight Xcj, ejeT, relative to weights 

Cj=-Cj. 

It will be evident that the efficiency of the method 
for treating optimum branchings is not seriously 
effected by a large change h (say of the form 10") in 
all the weights. In fact the method is easily modified 
to treat optimum spanning arborescences directly. 

If there is a spanning arborescence in G which is 
rooted at a prescribed node, say r, then an optimum 
one can be found by finding an optimum spanning 
arborescence in the graph G' obtained from G by 
adjoining a new edge e ( > (carrying arbitrary weight Co) 
which is directed toward r and directed from a new 
node having no other incident edges. Clearly, T is a 
spanning arborescence in G which is rooted at rif and 
only if T together with e is a spanning arborescence 
of G'. 

If the edges in graph G represent the links for pos- 
sible direct communication from one node to another, 
if each q is the cost of direct communication from the 
rear end of ej to the front end of c/, and if cost is 
additive, then a minimum-total-weight spanning ar- 
borescence rooted at prescribed node r represents 
the least costly way to have a message communicated 
from r to all other nodes of G. 

Another application is where it is desired to ar- 
range an institution into an optimum heirarchy 
(branchocracy). 
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take n to be the maximum of these two numbers or 
to be the vector consisting of both numbers. 

The proof of Theorem 1 is constructive. The theorem 
is proved by displaying one particular algorithm for 
optimum branching which is obviously good. 

If we remove from the optimum-spanning-arbores- 
cence problem the condition that each member of the 
set T of edges being optimized must have a different 
front end, then we get the optimum-spanning-tree 
problem. That is to find, if there is one, in any graph 
G with a numerical weight on each edge, a spanning 
tree which has maximum (or minimum) total weight. 

Especially simple algorithms are well-known for 
this problem [cf. 5 and 6]. 1 One is, starting with an 
empty bucket, build up a set of elements having 
"admissible structure" by putting elements into the 
bucket one after another as long as possible, so that 
each addition is a maximum weight element among 
those not in the bucket which, together with the ones 
already in the bucket, would preserve admissible 
structure. For the optimum-spanning-tree problem, 
the elements are the edges of G and "admissible" 
means "forest." The algorithm is certainly good. It 
is also valid for that problem. 

Where "admissible" means "branching," the above 
algorithm is not generally valid for finding an optimum 
spanning arborescence. Paper [3] abstractly charac- 
terizes those structures for which this "greedy al- 
gorithm" is valid for any numerical weighting. 

If we add to the conditions of the optimum-spanning- 
arborescence problem the condition that each member 
of the set of edges being optimized is to have a dif- 
ferent rear end, then we have the problem of finding, 
if there is one, an optimum spanning (uniformly 
directed) path in any graph G with a numerical weight 
on each edge. This is a version of the well-known 
traveling saleman problem [cf. 4]. I conjecture that 
there is no good algorithm for the traveling saleman 
problem. My reasons are the same as for any mathe- 
matical conjecture: (1) It is a legitimate mathematical 
possibility, and (2) I do not know. 

A good algorithm is known for finding, in any graph 
with a numerical weight on each edge, a maximum- 
total-weight subset of edges such that no two of them 
meet the same node [1, 2]. The treatment here of 
optimum branchings is similar. 



Our main result is 

THEOREM 1. There exists a good algorithm for find- 
ing, in any graph G with a numerical weight corre- 
sponding to each edge, an optimum branching. 

We say an algorithm is good if there is a polynomial 
function f(n) which, for every positive-integer valued 
n, is an upper bound on the "amount of work" the 
algorithm does for any input of "size" n. The concept 
is easy to formalize — relative, say, to a Turing machine, 
or relative to any typical digital computer with an 
unlimited supply of tape. 

For optimum branching, the largest number of 
significant digits in an edge weight, as well as the 
number of edges of G, must be figured somehow into 
the measure n of input "size." One might for example 



Section 4 

Here is the algorithm for finding a maximum-total 
weight branching in any (directed) graph G with a 
numerical weight q on each edge ejeG. Recall that 
a branching is a forest such that each edge is directed 
toward a different node. 

Begin the algorithm by applying instruction (I 1) 
where G* is G° = G and where D i and E l are empty 
buckets, D° and E°. 

(I 1) Choose a node v in G' and not in D\ Put v into 
bucket D\ If there is in G* a positively weighted edge 
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directed toward v, put one of them having maximum 
weight into bucket E\ 

Repeat (1 1) until 
(a) E i no longer comprises the edges of a branching in 
G 1 , or until (b) every node of G [ is in D\ and E l does 
comprise the edges of a branching. When case (a) 
occurs, apply (1 2). 

For convenience assume that every branching which 
we consider in graph G l contains all the nodes of G l . 
We say that a set of edges in G l forms the unique 
subgraph of G 1 consisting of those edges and all 
nodes in G\ 

Each edge e put into E' according to (I 1) is directed 
toward a node v which is the root of a connected 
component of the branching, say /?, formed by the 
edges in E l before e is put into EK If the rear end v 6 
of e is in a different component of B than v, then B U e 
is a branching, and so when e is put into E\ (a) does not 
hold. 

If v$ is in the same component of B as v, then B 
contains a unique path P going from v to v«. In this 
case, Q i = PUe is a circuit contained in J9Ue, so as 
soon as e is put into E\ (a) does hold. 

(I 2) Store Q* and a specification of one of the edges, 
say e^, of Q i which has minimum weight in Q l relative 
to the edge-weights for G*. Obtain a new graph G i+1 
from G' by "shrinking" to a single new node, v[ +1 , 
the circuit Q* and every edge of G { which has both 
ends in QK The edges (denoted as ej +1 ) of G i+1 are 
those edges (denoted as ej) of G' which have at most 
one end in Q\ Every edge of G ? which has one end in 
Q' will in G' + 1 have v\ +1 at that end. All other edge- 
ends are the same in G' + 1 as in G'. The nodes of Q l 
are not in G i + l . 

Every edge, say e| +1 , which as e^ in G j is directed 
toward a node, say v\, in Q* and directed away from a 
node not in Q\ gets a possibly different weight for G i + 1 : 



~i + 1 — r i _|_ n i — n x 
-3 c 3 ' c c 4 



(1) 



where c\ is the weight of e\ for G 1 '; where c§ is the 
minimum weight for G { of an edge, say ej, in Q l \ and 
where c\ is the weight for G [ of the unique edge, say 
e\, which is in Q l and directed toward i>|. AH other 
edges in G 7 + l keep the same weight as for G l . 

In justifying the algorithm we shall make use of the 
following relations 



(2) 



0, (3) 



cj, and (4) c\ 



Put into bucket D i+l the nodes which are in both 
G l + 1 and bucket D\ (Do not at this point put v\ +l into 
D i + 1 .) Put into bucket E i+l the edges which are in 
both G 141 and bucket E\ i.e., put into bucket E i + 1 
the final contents of bucket E j minus the edges of 
circuit Q 1 . It is easy to see that the edges in bucket 
E i + 1 form a branching in G i + l . Continue the algorithm 
by applying (1 1) where i is one greater. 

Eventually, after a small number of applications of 
(1 1) and (I 2), case (b) must occur. 



As soon as (b) occurs, for say i = k, (II) and (12) 
are never applied again. Instead, (I 3) is applied suc- 
cessively for £-f 1 = &, k— 1, . . ., 1, until the graph 
G i obtained is the original G. At that point, the branch- 
ing B l = B° is a maximum-total- weight branching of G. 

The final contents of bucket E k form a branching 
in graph G k which we call B k . 

(13) It is not difficult to see that since B i + l is a 
forest in G ? + 1 and since G , + 1 is obtained from G { 
by shrinking the circuit Q l in G* (and all edges of G { 
with both ends in Q*) to the node v\ +l of G i + \ the 
subgraph H i of G ? , formed by the edges in Z?* 41 and 
the edges in Q i contains only one polygon, namely Q'. 

In the case where v\+ l is not a root of (a connected 
component of) branching B ,-+1 in G /+1 , there is a 
unique edge, say ej +1 , of B i + A which is directed 
toward v\ +{ . In G% e\ is directed toward a node, say 
v l 2 , of Q l , Since Q* h a circuit, there is a unique edge, 
say ej, of Q* which is directed toward v\. Clearly, e\ and 
ei are the only two edges of H l which are directed 
toward the same node. Thus, since Q' is in the only 
polygon of H\ deleting el from //' yields a branching 
in G\ which is called B\ 

In the case where v\ +] is a root of branching B i + l 
in G < + 1 , i.e., where no edge of fi , ' + 1 is directed toward 
v[ + \ no two edges of//' are directed toward the same 
node. Therefore, deleting any edge of Q j from //' 
yields a branching in G'. To obtain the branching /?' 
in G', delete from //' one of the edges e l of Q i which 
has minimum weight C(j. 

That completes the description of the algorithm. 
Evidently it is a good algorithm. Evidently its output 
is a branching B° in graph G. In order to prove The- 
orem 1, what remains to be done is prove that B {) has 
maximum total weight. 



Section 5 

Theorem 1 and the following geometric theorem are 
proven together. 

Let G be any graph. (No edge-weights are specified.) 
Let there be a real variable Xj for each edge e,eG. Let 
Pa be the polyhedron of vectors x = [xj] which satisfy 
the system La, consisting of inequalities L u L 2 , and 

u 

(Lt) For every edge e,-eG, Xj ^ 0. 

(L 2 ) For every node veG, Sxj^I, summed over all 

/s such that ej is directed toward v. 
(L 3 ) For every set S of two or more nodes in G, 



Xxj ■ 



1, 



summed over all /s such that ej has both ends in 
S. (\S\ denotes the cardinality of S.) 

Any vector x = [xj] of zeroes and ones is called the 
(incidence) vector of the subset of e/s such that Xj= 1. 

THEOREM 2. The vertices of polyhedron P G are pre- 
cisely the vectors of the subsets of edges in G which 
comprise branchings. 
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A polyhedron {convex polyhedron) P is the set of all 
the vectors, i.e., points, which satisfy some finite 
system L of linear inequalities. A vertex (extreme 
point) of P is a point which, for some linear function, 
is the unique point in P which maximizes that function. 

A basic point x = x° of a finite system L of linear 
inequalities is the unique solution of a system, 
jXaijXj = bi, iel, such that jXaijXj ^ bi, iel, is a sub- 
system of L. 

If basic point x° of L is in the polyhedron P of L, 
then it is a vertex of P, because clearly x° is then the 
unique point in P which maximizes j^d^a^Xj, iel. 

We shall see without difficulty that any point x°, 
which is the vector of a branching say B° in G, is a 
vertex of Pa. Vector x° satisfies L x since it is all zeroes 
and ones. Vector x° satisfies L 2 for any node veG, 
since, by the definition of branching, at most one of 
the x/s in this inequality has value 1 for x°. 

The branching B° is a forest, so any set S of nodes, 
together with the subset E% of the edges in B° which 
have both ends in S forms a forest. The number of 
edges in a forest is at most the number of nodes in 
the forest minus 1; in particular, \E$\ ^ \S\ — 1. There- 
fore, vector x° satisfies L 3 for any subset S of (two 
or more) nodes in G, since \E§\ of the x/s in this 
inequality have the value 1 for x°. Summarizing the 
conclusion so far, x° is a point in Pa. 

Vector x°is the unique solution of the linear system: 
Xj = for every edge ej not in B°, and Xxj= 1 (summed 
over e/s directed toward v) for every node v which 
has some edge of B° directed toward it. This system 
can be obtained from certain of the relations of L\ 
and L 2 by replacing their inequality signs. Therefore 
x° is a basic point of L G , and hence a vertex of Pg. 

Most of this paper is directed toward proving: 

Lemma 1: Every linear function, ScjXj (summed 
over all edges ejeG), is maximized in P G by the vector 
of some branching in G. 

From Lemma 1 and from the definition of vertex, 
it follows immediately that every vertex of Pa is the 
vector of a branching in G. This will conclude the 
proof of Theorem 2. 

A branching B° in graph G has maximum total weight 
relative to the vector c = [cj] of edge- weights if and 
only if the vector x°=[x ( jj of B° maximizes (c, x) = j^cjXj 
over all vectors of branchings in G. If x° maximizes 
(c, x) over Pa, then it maximizes (c, x) over the vectors 
of branchings in G, since the latter are in Pa. 

Our task, therefore, is to show that the vector of 
the branching B°, produced by the algorithm, maxi- 
mizes (c, x) over Pa- This will prove that the algorithm 
is valid and will prove Lemma 1. 



Xy] ^ for every 77, and 
riXagnXr) ^ \b% for every £, 
and that y=[yd is any vector which satisfies 
y% ^ for every £, and 
iXa&yz ^ cr) for every 77. 
Since (6) and (7) imply 

fXivXa&x^yt ^ &beye= (6, y) , 
and since (5) and (8) imply 

rj%({£a&y{)xri ^ ril t Cr i Xr}= (c, %) , 

we have 

(c,x) ^ (6,y). 



(5) 
(6) 

(7) 
(8) 

(9) 
(10) 
(ID 



Since (11) holds for any x and any y, if (c, x°) = (b, y°) 
holds for particular x — x^ and y=y°, then x° must 
maximize (c, x) and y° must minimize (b, y). 

Suppose for particular x = x l and y—y l that 

r)Xa^-nXr,= bg for £ such that y\ ¥" 0, (12) 



and 



g%a&yl = c-q for 17 such that Xt) ¥^ 0. (13) 



Since (12) implies equality in (9), and (13) implies 
equality in (10), we have (c, x 1 ) = (b, y 1 ). Therefore, 



and 



x 1 maximizes (c, x) 



y 1 minimizes (b, y). 



(14) 



Our present interest is where (5) is (Li), and (6) is 
(L 2 ) and (L 3 ). For any linear function (c, x)=j%CjXj of 
points xePa, we get a dual system (7), (8), (b, y), by 
letting a variable y^ correspond to each inequality 
of L 2 and L 3 . That is let a variable yn correspond to 
each node VhtG and let a variable y s correspond to 
each set S of two or more nodes in G. 

For (7) we have, 



for every Vh, yn^ o, 



(15) 



and 



Section 6 



for every S, y s ^ 0. 



(16) 



The following computations are well-known in Coefficient «/ij— 1 if edge ej is directed toward 

linear programming. Suppose that je = [%T)] is any node !>&, and ahj = otherwise. Coefficient a S j = 1 
vector which satisfies if edge ej has both ends in S, and a S j = otherwise. 
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For every v h , bh=l. For every S, b 8 = \S\- 
Therefore, (8) becomes 



1. 



for every edge ejeG, 
jh + Wj ^ Cj, where v\ x is the front end 
of ej, and where w, = 2y s , summed over 
all sets S which contain both ends of ej. 

Function (6, y) becomes 



(17) 



(6,y)=*Sy* + .2(|S|-l)y., 

summed over all Vh and over all S. 

Recall that our task is to show that the vector x° 
of the branching B°, produced by the algorithm, 
maximizes (c, x) over Pa. 

In view of (14), we do so by constructing a vector 
y = [yh, ) r s] which satisfies (15), (16), (17), and which 
satisfies (12) and (13). For the present system, (12) is 

for every node v\ x such that y/, ¥" 0, 

X*j — 1, summed over /s such that ej 

is directed toward vn\ (18) 

and 

for every set S such that y s # 0, 

£*j— \S\ — 1, summed overj's 

such that €j has both ends in S. (19) 

In other words (18) says that if », ¥" then an edge 
of the branching B° is directed toward Vh, and (19) 
says that if y s ^ then exactly | S \ — 1 edges of B° 
have both ends in S. 

For the present system (13) is 

for every edge ej in the branching Z?°, 

yh + Wj = Cj, where vu and Wj are as in (17). (20) 



Section 7 

For each graph G l (i= k, k — 1, . . ., 0) with 
weight c) on each edge e]eG\ and for the branching 
B l in G 1 , we will describe a vector y { which satisfies 
(15)-(20), where G and B° are replaced by G 1 and B { 
and where vector y is y\ 



First we describe a y k 

y i + 1 (i = k- 



and then, assuming a 
1, . . ., 0), 



: y 



we describe a y\ Thus by induction we obtain a y- 
and the proof of Theorems 1 and 2. 

The vector y k —\y k l , y k ] is y*=0 for every set S of 
two or more nodes in G k , y£=0 for every node v k in 
G* which has no edge of B k directed toward it, and, 
for every other node v§ in G k , yji = cf where edge ejf 
of B k is directed toward v%. Conditions (15)-(20) for 
y k can be immediately verified from the fact that for 
every node vf { eG k either there is no edge of B k directed 
toward v% and there is no positively weighted edge 



directed toward v§, or else, among all the positively 
weighted edges directed toward vjjf, the one in B k has 
maximum weight. 

Now, suppose that we have a y}, + 1 for each node 
v\t l and a yi +1 for each set S of two or more nodes in 
G /+1 , such that (15)-(20) are satisfied (where B l) is 
replaced by Z?' +1 , etc.). 

Let 4 +1== 2y| +1 , summed over the sets S which 
contain node ^j, +1 . 

To make the induction go through we assume fur- 
ther that in G i+1 



for every node Vh, such that th + yu > 0, 
there exists at least one edge ej directed 
toward vu such that q = th + yn- 



(21) 



This clearly holds for G A , and we will prove from 
(15)-(21) for G M that (15)-(21) holds for G\. 

Obtain the vector y j as follows: 

Where A is the set of nodes in circuit Q' of G', where 
eli is the edge of Q' not in B\ where i4 is the front end of 
ej, where cj) is the minimum weight in Q\ and where 
^U 1 is the node in G' +1 to which Q i was shrunk, let 



and 



yj = 34 +1 + c£-ci, 



yi = ci-yi-ti +1 . 



(22) 



(23) 



Where i/J is any node in A other than i4, and where 
e\ is the edge in Q l which is directed toward v l 3 , let 



yi = ci-y!t-n +1 . 



(24) 



Observe that (24) holds also for p£= i4. 

Where i^|is any node of G* which is not in Q\ let 



yh = yk"- 



(25) 



Where /? is a nonempty subset of nodes in G' +1 
which does not contain v{ +1 , where J = RUv[ +i , where 
K = RUA, and where L is any set of two or more nodes 
in G' such that LDA is a proper subset of A, let 



yk-- 
yk-- 



yit 



■-XT 



yi=0. 



(26) 

(27) 

(28) 



That completes the description of vector y'. Now we 
must verify (15)— (21) for it. 

For every edge of G* which is directed toward a node 
not in A, for every node not in A, and for every set S, 
except A, in G\ conditions (15) -(18), (20), and (21) 
follow immediately from those same conditions for 
y ,+1 , (25) — (28), and the local nature of the change 
from G /+1 , fi |-+1 , and c M to G\ B\ and c*. 
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For every subset of nodes in G i which does not con- 
tain all of A, condition (19) follows immediately as 
above. For set A and for every set K as in (27), condi- 
tion (19) follows from (27), condition (19) for set J 
in G i+1 , and the fact that there are exactly 

\K\-\J\=\A\-l 

more edges of B l with both ends in K than there are 
edges of B i+1 with both ends in J, namely the edges of 

It follows from (24), (27), and (28), that (21) holds 
for every node v^ in A (in particular where e, is the e' 4 of 
(24)), and that (20) holds for every edge of B' l C\Q\ and 
that (17) holds for el 

Condition (18) follows immediately for each node of 
A except v{ since there is an edge of B^Q 1 directed 
toward it. If there is an edge e\ +1 in B i+1 which is di- 
rected toward v\ +1 , then e{ is an edge of B' which is 
directed toward *>|, and so in this case (18) follows for 
v[. Otherwise, if there is no edge of B i+l directed 
toward v[ + \ then by (18) for v\ + \ yi +1 = 0. Also in this 
case, the c& of (22) was chosen in the algorithm to be c&. 
Therefore, if there is no edge of B j+1 directed toward 
i4 +1 , then (22) is yj = 0, and so (18) follows for vl 

For ef, the only edge, if any, which is in B' — Q* and 
directed toward a node in A, we have 

c[ +1 = c\ + cl-ci (from (1)), (22), y[ +1 + w\ +l = c[ +1 

which is (20) for e[ + \ and w{ = w[ +1 from (27) and (28). 
Combining these we get yi + w[ = c/, which is (20) 
for e\. 

Thus conditions (18), (19), (20), and (21) are now 
completely accounted for. Condition (17) for edges not 
in Q i but directed toward nodes in A, condition (16) 
for y l A , and condition (15) for nodes in A, remain to be 
verified. 

Let el be any edge of G l which has both ends in A, 
and let vi be its front end. To prove (17) for ej, which is 
yi + wi^ci where wi = y\-\- 1\ +1 , combine (24) and 
ci ^ cl 

Let ei be any edge of G i which has its front end vi 
in A and its rear end not in A. To prove condition (17) 
for e?, which is yi + wi^ci where wi = w l 3 +1 , combine 
(24), (23), (22), (1), and (17) for e| +1 . 

To prove (16) for A, that is y A ^ 0, we use (21) for 
i4 +1 . Assuming t[ +1 + y\ +l >0, let e& +1 be the e- 3 of that 
relation, let v\ be the front end of e\ in A, and let e] 
be the edge of Q i which is directed toward vi Here (21) 
is c i 3 +1 = t[ +1 + yi +1 . In this case, obtain y A ^ by com- 
bining (23), (22), (21) for v[ + \ (1), and (4). 

If there is no 4 +1 directed toward v\ +1 such that 
4 +1=r 4 +1 + M +1 , then t\ +1 -\-y\ +l = 0, and all edges 
directed toward v\ +1 have negative weight in G l +\ so 
none of them are in B i+1 . Therefore since in this case 
the c| of (22) was chosen to be cj, (22) becomes 34 = 0, 
and (23) becomes y A = c l . By (2), we have y\ ^0. 

Prove (15) for any node v\ in A by combining (24), 
(23), (22), (3), and y\ +1 ^ 0. 

That completes the proof of Theorems 1 and 2. 



Section 8 

Notice from the proof that if every weight Cj, ejeG, 
is an integer, then the vector y°, as well as vector x°, 
is integer-valued. In particular, where every c/=l, 
vector y° is 0,1-valued and max(c, x) = min(b, y) is a 
simple "Konig-type" theorem, analogous to the maxi- 
mum-cardinality-matching duality theorem in [1]. 

The following two theorems can be proved by the 
methods used here. 

Theorem 3. Where (L 4 ) is S^j=n, summed over all 
edges ejeG, the vertices of the polyhedron given by 
(Li), (L 2 ), (L 3 ), and (L 4 ) are precisely the vectors of the 
n-cardinality subsets of edges in G which comprise 
branchings. (In particular, where n is one less than the 
number of nodes in G, these branchings are the span- 
ning arborescences ofG). 

The present research began when A. J. Goldman 
asked for a description of "the convex hull of the 
spanning trees of a graph." Theorem 4 is proved in [3]. 
THEOREM 4. The vertices of the polyhedron F G given by 
(Li) and (L 3 ) are precisely the vectors of the subsets of 
edges in G which comprise forests. The vertices of the 
intersection of F G with (L 4 ) are a subset of the vertices 
of Fa. 



Section 9 

Figures (A) through (E) illustrate the algorithm for 
finding an optimum (i.e., maximum total weight) 
branching in graph (A). Each dashed edge is dashed 
because it has maximum positive weight among those 
edges directed toward its front end. As soon as a 
dashed circuit arises it is shrunk and certain new edge 
weights are computed, thereby producing the edge- 
weighted graph of the next figure. The final graph is 
not drawn since it is simply a node. After the sequence 
of figures is completed, except for the boldness of the 
bold edges, then working backwards through the 
sequence, appropriate dashed edges are made bold. 
The answer is the branching in (A) formed by the bold 
edges. It is the only correct answer. 

Figure (F) illustrates a "dual answer," y, for the 
same problem. The numbers in the squares on the 
nodes are the values of the nodal y-variables. The 
numbers in the squares on the closed curves are the 
nonzero values of y-variables corresponding to sub- 
sets of nodes. Each closed curve encloses the subset 
of nodes to which its number corresponds. Observe 
that the vector y, thus represented, satisfies relations 
(15)-(20), and thus guarantees that the branching is 
optimum. This y is not the one described in section 8. 

The example (A)-(E) was actually obtained by con- 
structing it all except the edge-weights first. Then all 
the numbers in (F) were chosen so as to yield the 
structure (A)-(E). 

Figure (G), by coincidence, nicely represents three 
different things. 

It illustrates a nonoptimum branching obtained by 
applying the greedy algorithm. This branching has 
total weight 128, whereas the branching shown in (A) 
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has total weight 131. One other branching that might 
be obtained by applying the greedy algorithm has total 
weight 127. 

Since the optimum branching shown in (A) is a 
spanning arborescence, it is an optimum spanning 
arborescence, and thus it is still an optimum spanning 
arborescence when all of the edge-weights are changed 
by adding any constant. However, when all the edge- 
weights are changed by adding the constant, — 10, the 
branching shown in (A) is no longer an optimum 
branching, even though all the weights in this branch- 
ing are still positive, and even though this branching 
is still an optimum spanning arborescence. For these 
new edge- weights there are two optimum branchings, 
quite different from each other, and both quite differ- 
ent from the branching shown in (A). Neither one of 
them is a spanning arborescence. One of them con- 
sists of all the bold edges in (G) except for the edge 
which is weighted —5 (relative to the new weights). 
This branching has total weight, 53, whereas the 
branching shown in (A) has total weight, 51. 

A direct algorithm for finding, if there is one, an 
optimum spanning arborescence is obtained from the 
algorithm for finding an optimum branching simply by 
deleting the words "positively weighted" from (II). 
This follows from the fact that the only effect on the 
resulting algorithm of adding a constant to each num- 
ber of an input is to add the same constant to every 
number that arises in the algorithm. 

Because of the words "positively weighted" in the 
optimum branching algorithm, the effect of adding 
— 10 to each edge-weight in (A) is that the resulting 
application of the optimum branching algorithm does 
not dash the edge that is weighted —5, and does not 
dash in (D) the edge that is weighted — 1. Thus there is 
no shrinking in (D); the sequence of graphs stops at (D). 
In (D), only the edge that is weighted 7 is made bold 
since it is the only one that gets dashed. (The present 
computation is not explicitly illustrated.) Unlike in the 
spanning arborescence problem, we have in (C) no 
bold edge directed toward a node in the dashed circuit. 
Therefore, the edges of the dashed circuit in (C), 
except for one or the other of its minimum-weight 
edges, are made bold. The choice here is what gives 
rise to the two correct answers. It is interesting to 
note that the two edges which are tied in this step of 
the computation do not have the same edge-weights 
in (A), and that the two optimum branchings, arising 



from the two choices, are globally quite different. I 
recommend carrying through the completion of each. 
Figure (G) also illustrates an optimum spanning 
arborescence having the lower left node prescribed as 
root. The first phase of the algorithm for obtaining it is 
the dashing and shrinking and computing of new edge 
weights just as in (A)-(E). The only difference from 
(A)-(E) is the way the edges are chosen from among the 
dashed ones to be made bold. An optimum spanning 
arborescence rooted at any other prescribed node is 
obtained from this same first phase of (A)-(E) by 
appropriately choosing edges from among the dashed 
ones. The subgraph of (A), formed by the image in (A) 
of all edges dashed somewhere in (A)-(E), in general 
contains nonoptimum spanning arborescences with 
prescribed root as well as optimum ones. Therefore, 
the choosing does depend on the structure of the 
sequence (A)-(E). In general, it is a nice feature of 
the computations for finding in the same edge-weighted 
graph, when they exist, optimum spanning arbores- 
censes with various prescribed roots, that these compu- 
tations are identical except for the (13) part. 



This paper was to have appeared in the published proceedings 
of the International Seminar on Graph Theory and Its Applications, 
Rome, July 1966, sponsored by the International Computation Cen- 
ter. Various international failures of communication during the 
editorial process precluded it. I am sorry to have lost that oppor- 
tunity to record my contribution to an outstanding symposium. I 
wish to acknowledge here my appreciation to the organizers of the 
symposium for the excellent job they did and for their kindness 
to me. 
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